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The present invention relates generally to the transmission or broadcasting of 
digital information to a wide base of subscribers, and particularly to methods and 
systems for providing access by many subscribers to an almost unlimited amount 
of information and/or programming material. 

5 

BACKGROUND OF THE INVENTION 

The goal of computerized information servers is to provide a system that 
encourages use by subscribers, provides access to a large range of information, 
10 and which if . flexible and inexpensive for information publishers. Computerized 
information services which exist today include Prodigy, Compuserve, and Dialog. 
Typically, the user either pays a monthly fee or a connect time fee for the most 
basic service and additional or special services are provided for additional fees. 

1 5 All of the above mentioned computer services require two way communications 
between the user and the information server in order to service each new request 
by the user. In other words, the user sends requests, and then selected data 
is transmitted individually to that user in response to his/her request. This two 
way "query based" communication scheme is also prevalent in other types of 

20 information servers. For instance, the "pay per view" services provided by cable 
television companies allow a subscriber to call the company and request a movie. 
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The company then sends a signal to the subscriber's television signal decoder 
to enable it to decode the program on a particular channel during the period of 
time associated with the requested movie. In this case, the "information" (a 
program or movie) is being transmitted in any case, but subscribers are only 
5 provided access to the program if they pay a special fee. 

Two major distinctions between the present invention and the prior art query 
based information servers are (1) the present invention has a bandwidth that 
is thousands of times of greater than the prior art systems, thereby enabling high 

10 speed, low cost distribution of information, and (2) the present invention greatly 
reduces the amount of two-way communication required between subscribers 
and the information server by automatically transmitting information that responds 
to most anticipated "requests" by subscribers. Only those subscriber requests 
not satisfied by the automatically transmitted information need to be conveyed 

15 to the information server. 

The goal of the present invention is to provide widespread, high speed access 
to a virtual onniscient database having typically well in excess of a terabyte (10 12 
bytes) of data. Using a data channel capable of transmitting, say, 1 .5 megabytes 

20 of data per second, the present invention can provide virtually instantaneous 
access to about a 1 00 megabytes of information, high speed access (e.g., within 
two hours) to about a gigabyte of information, and can provide medium speed 
access, with perhaps 12 or 24 hour turn-around, to perhaps 100 terabytes of 
information. Furthermore, such access can be provided to a very large set of 

25 users without having to use a large number of data channels. 

The large bandwidth of the present invention's information server enables the 
system to provide subscribers with multimedia programming, including video and 
audio programming. This is unlike current query based information servers, 
30 whose low bandwidth makes the distribution of video and audio programming 
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impractical. The ability of the present invention to distribute information in 
multimedia form makes it much more attractive to both subscribers and potential 
information publishers, including advertisers. 

5 It is important to note that while a user has access to perhaps a terabtye, or even 
100 terabytes or more, of data, the total amount of data that systems in 
accordance with the present invention system can transmit in any one day is 
much more limited, as will be described below. This is not unlike visiting the 
main library of a major university, such as Yale or Harvard University, having 

1 0 stacks containing several million volumes of books. Having "access* to all those 
books every single day does not means that a user can receive them all in one 
day, nor does it mean that all the users can receive all the books in a single day. 
Nevertheless, > >ach particular book (file or program) is available on relatively short 
notice, and having access to such a large collection of books (data) is still very 

15 useful. 

The present invention also has tremendous cost advantages compared with 
information distribution using compact disks (CDs), and is believed to have 
distribution costs on the order of one thousand times less than any competing 

20 distribution media. The present invention eliminates both the need to generate 
■master* liisks and the need to manufacture any physical media. Since 
distribution is accomplished by broadcasting, the actual costs for "publishing" 
a million copies of an average length novel (i.e., distributing it to a million 
subscribers) would be on the order of just a few dollars, versus a cost on the 

25 order a dollar or more per copy for printing either CDs or traditional books, plus 
similar costs for the physical distribution, resulting in traditional distribution costs 
totalling millions of dollars. Furthermore, updates to previously distributed 
information are also distributed at equally low cost using the present invention, 
in contrast with CD based publishing which requires mailing or otherwise 

30 physically producing and distributing new CDs to all previous purchasers. 
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SUMMARY OF THE INVENTION 

In summary, the present invention is an information broadcasting system which 
provides a large number of subscribers access to a large amount of information 
5 using one or more satellite transmission channels. The broadcasting system 
can also use cable television transmission channels or any similarly structured 
data distribution network. The system has a program supplier station which 
stores an information database and tags all the information in the database with 
indices so as to form a single hierarchical structure which encompasses the entire 

1 0 information database. Portions of the information database are transmitted often, 
at least once per day, in order to provide the basic subscriber with information 
needed to access the remainder of the database. The information provided by 
the basic subscriber service, which will typically include at least 50 gigabytes 
of data, is available to all subscribers without requiring two way communications 

15 between the subscribers and the program supplier station. 

By using a tiered" system for scheduling transmission of the 50 gigabytes or. 
so of information included in the basic subscriber service, as well as an 
■intelligent" subscriber request anticipation scheme for retrieving information 

20 before the subscriber asks for it, the present invention provides a huge number 
of subscribers with reasonably quick access to all the contents of the large 
database. This is accomplished even though only a modest amount of bandwidth 
is used. Furthermore, by reserving a portion of the system's bandwidth for 
satisfying requests for access to information not provided with the basic 

25 subscriber service, timely access to a virtually unlimited amount of information 
can be provided, using the same modest transmission bandwidth, to thousands 
of those subscribers willing to pay additional fees for that service. 

The utility of the present can be further enhanced by using data compression 
30 techniques so as to increase the bandwidth of the information server, and by 
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down loading software, data compression code books and the like along with the 
transmitted information. The present invention also includes the ability to 
automatically update the software used by subscribers' computers by transmitting 
the software updates along with other data transmitted by the information server, 
5 with different versions of the updated subscriber software being transmitted for 
each of the different platforms (i.e., types of CPUs) being used in subscribers' 
computers. 



Additional objects and features of the invention will be more readily apparent 
from the following detailed description and appended claims when taken in 
conjunction with the drawings, in which: 

15 Figure 1 is a .block diagram of a "hyperbroadcast" data transmission system in 
accordance with the present invention, showing a receiving station suitable for 
use by a group of users. 

Figure 2 is a block diagram of a typical receiving station for use by an individual. 



Figure 3 depicts some of the data areas and software routines stored in the 
memory storage area of a subscriber station. 

Figure 4 is a block diagram of a system for rebroadcasting onto a cable television 
25 network the data stream received from a satellite, as well as a local information 
source which can insert data into the data stream in lieu of data from the satellite. 

Figure 5 is a block diagram of a local area network of subscribers. 

30 Figure 6 is a block diagram of a data packet. 



BRIEF DESCRIPTION OF THE DRAWINGS 
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Figure 7 is a block diagram of a data fitter subsystem used in subscriber stations 
for receiving broadcast data and video signals. 

Figure 8 depicts the data structure of the filter list used to select data packets 
5 received by a subscriber station. 

Figure 9 depicts the hierarchical structure of the broadcast data. 

Figure 10 is a block diagram of a "user profile" data structure which specifies 
10 data to be stored and updated in a subscriber's computer. 

Figure 11 depicts the index and timestamp information embedded in the 
broadcast data. 

15 DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring to Figure 1 , there is shown an information broadcasting or transmission 
system 100 having a single program supplier station 102 which transmits a stream 
of data via a satellite 1 04 to a large set of receiving stations 1 1 0 owned or leased 
20 by subscribers 110. Subscribers are persons, organizations and companies with 
equipment to receive transmissions on one or more dedicated channels broadcast 
by the satellite 104. 

In one implementation of the invention, it is expected that subscribers will have 
25 access to the basic service without charge. This basic service provides unlimited 
access to a base set of information, which in the preferred embodiment is about 
fifty gigabytes of public service information, including information such as 
newspapers, weather reports and the like that are updated frequently. 
Subscribers also have free access to all information on the service which 
30 advertisers or other suppliers of paid programming, such as governments and 
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schools, have provided for distribution to the general user community. This 
access may include the ability to receive the broadcasted advertising information 
as well as to make special telephone requests for additional information. Multiple 
channel versions of the system, which will include additional programming on 
5 separate transmission channels, will provide unlimited access to an even larger 
base set of information. In the preferred embodiment, all broadcast channels 
will include a mix of text, graphics, multimedia, audio, video and software 
programming, although the mix of programming materials will typically differ from 
channel to channel. 



Subscribers who request transmission of data not included in the basic service 
could be charged for accessing that information. Information of this type is such 
that the provider of the information (e.g., a person or company owning copyrights 
on the information) has placed restrictions on how the information may be 

15 distributed. For example, the provider may require a transaction fee or royalty 
be paid before providing the subscriber with access codes to find or decrypt the 
information. Alternately, access to a set of information may be limited by a 
provider's requirement that he know who is receiving it. In either case, the 
subscriber, or his computer system, requests such information by calling the user 

20 request processor at a particular telephone number (toll free or otherwise) and 
specifying the information that the subscriber wishes to receive. The user is then 
given access codes and possibly decryption keys to enable the user to access 
the requested information. 

25 Alternately, subscribers could be charged a monthly fee for the basic subscriber 
service. Access by nonsubscribers, in systems which charge for the basic 
subscriber service, would be blocked either by encrypting the transmitted data, 
or by giving only subscribers the packet ID required to access the root information 
required to access the rest of the database, as will be described in more detail 

30 below. 
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The database 1 12 of information provided by the program supplier station 102 
is indexed in a hierarchical fashion, providing keyed or indexed access to all data 
within the database 1 1 2. The database 1 12 will typically have at least a hundred 
times more data that is available for retrieval than the data which is provided 

5 with the basic service. Subscriber requests for additional information are 
transmitted by conventional modems and telephone lines to the program supplier 
station 102, after which transmission of the requested information is scheduled 
for delivery (i.e., transmission) in the portion of the satellite's considerable 
bandwidth that is not used for transmitting the regularly scheduled basic 

1 0 programming. In the one-channel preferred embodiment, the system can transmit 
32.4 gigabytes of specially requested data per day, which is expected to be 
sufficient to serve a subscriber base of approximately one million home users 
or several tens of thousands of commercial users. See Table 1 below. Additional 
transmission channels can be added as the subscriber base increases in size, 

15 and/or different satellite transmission channels can be used for different 
geographical areas. 

As will next be described, the present invention uses a "tiered" system for 
scheduling transmission of the 50 gigabytes or so of information included in the 

20 basic subscriber service, as well as an "intelligent" subscriber request anticipation 
scheme for retrieving information before the subscriber asks for it. Together, 
these features of the present invention provide the system's subscribers with 
virtually instantaneous access to an large information database while using only 
a very modest amount of transmission bandwidth. Furthermore, by reserving 

25 a portion of the system's bandwidth for satisfying requests for access to 
information not provided with the basic subscriber service, timely access to a 
virtually unlimited amount of information can be provided to those subscribers, 
or advertisers, willing to pay additional fees for that service. 
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Overview of Basic Operation 

Referring to Figures 1 and 2, the data stream is transmitted over a single satellite 
channel at a rate of 1.5 megabytes per second in fixed size 10KB (10 kilobyte) 
data packets having a maximum size of 10,000 bytes. The subscriber receiving 
5 stations 1 10 all include a data filter subsystem 120 (see Figure 2) which receives 
the incoming data stream and selects those data packets that meet selection 
criteria defined by the receiving station 110. The information in the selected data 
packets is then downloaded into the memory 122-124 of the subscriber's 
workstation or other computer using software provided by the information network 
10 supplier for that purpose. 



Typically, all subscriber stations are programmed to store a set of "root 
information", comprising about 250KB of information, which provides an index 
to the network's information database 112. Data packets containing this root 
1 5 information are marked to indicate the last time the root information was updated, 
and the subscriber stations are programmed to automatically replace their internal 
copy of the root information as updated information is received. 

In addition, each subscriber can specify additional information from the database 
20 112 that the subscriber wants to receive. All information in the system is 
hierarchically arranged, with the root information comprising the top several layers 
of the hierarchy. As will be described below, subscriber "requests' are made 
by selecting items from the top level menus, or by selecting "markers" in 
previously retrieved data that reference other portions of the network database. 
25 Each such marker includes a packet ID that is used by the receiver to select 
packets to be downloaded to the subscriber's computer, as well as a timestamp 
value indicating (A) whether or not the requested information is included in the 
basic subscriber service, and, if so, (B) the approximate time that the requested 
data is scheduled for transmission. Timestamps are described in more detail 
30 below. If the requested information is already scheduled for transmission 



A-54591/GSW 



- 10 - 

sometime in the near future (either because it is part of the basic service, or 
because another subscriber recently requested the same Kern), the subscriber 
is informed of the expected arrival time. 

5 If the requested data is not included in the basic subscriber service, and if the 
subscriber is authorized to request additional information (i.e., has signed an 
agreement to pay for the transmission of addition data), the request for additional 
information is transmitted by conventional modems and telephone lines to the 
program supplier station 102, after which transmission of the requested 

10 information is scheduled for delivery (i.e., transmission) by the program supplier 
station. If the packet IDs associated with the requested information are not 
already included in the root information available to the subscriber, the program 
supplier station 102 also provides the subscriber with the packet IDs required 
to access the requested information. For information having an associated 

15 delivery charge, the subscriber's account is debited by a charge corresponding 
to the information requested. 

i 

In the preferred embodiment, each request is assigned a priority level (e.g., low 
priority night-time delivery, regular two-hour delivery, and rush half-hour delivery), 

20 and the program supplier station 1 02 schedules delivery of requested information 
accordingly. Information access charges are based both on the volume of 
information requested and on the request's assigned priority level. If the 
requested priority level is not available due to an unexpected surge in requests, 
the subscriber station is informed that the request priority has been downgraded 

25 and the subscriber is given the opportunity to cancel the request. 

Program Supplier Station 

As shown in Figure 1, the program supplier station 102 includes the 
aforementioned database 112, which is typically stored on a large number of 
30 high density magnetic disk devices. The program editing software 132 on the 
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central program supplier station 1 02 tags all the information in the database 112 
with indices (each of which contains a packet ID plus additional information) so 
as to form a single hierarchical structure that encompasses the entire information 
database. More specifically, software 132 (executed by CPU 130) generates 
5 a hierarchical set of indices referencing all the data in the information database 
112 and embeds those indices in the information database. 



In an alternate embodiment, the packet IDs and the associated indices embedded 
in the database can be generated off line. In particular, each information provider 

1 0 can be assigned, in advance, a block of packet ID values. If the packet ID values 
are made sufficiently long, such as 64 or 96 bits long, then assigning blocks of 
packet ID values to information providers will not deplete the supply of such 
values. Each information provider is required to organize the information it wants 
to publish into one or more subtrees, embedding the associated indices therein, 

15 and then providing the information to be published to the information server. 
The information server will then "graft" the subtree(s) of information provided 
by each information provider onto the information hierarchy broadcast by the 
information server. 



20 The program supplier station 102 includes at least one central processing unit 
(CPU) 1 30, software 1 32 executed by the CPU 1 30 for editing program materials 
and for scheduling delivery of both basic service information and subscriber 
requested data. In some embodiments, the supplier station 102 will be a 
distributed system utilizing many CPUs interconnected using a local area or even 

25 a wide area network. In that case, the station 1 02 shown in Figure 1 represents 
the information collection node, which collects all the information from various 
programming material providers, combines, serializes and transmits the 
information. 
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Memory 133 stores, among other things, scheduling data which is used to control 
the timing at which each packet of information in the information database 112 
is to be transmitted. As will be described below, the timing information in the 
schedule data is incorporated into timestamp markers" that are transmitted along 
5 with the transmitted data, enabling subscriber stations to know in advance when 
selected information items will be received. 

User request processor 134 is interfaced to a large number of conventional 
telephone lines 136, using individual modular interface cards for each telephone 

10 line so that the system is expandable. Each interface card is programmed to 
receive and confirm information requests from subscribers, as well as to handle 
subscriber account transactions such as setting up new subscriber accounts, 
providing data decoding keys to new subscribers, and accepting credit card and 
debit card payments of subscriber balances. In response to each information 

1 5 request received, the request processor 1 34 queries a subscriber database 1 38 
to ensure that the subscriber is authorized to request additional data, sends the 
request to the scheduling software 1 32 so that the transmission of the requested 
information will be scheduled, and enters the corresponding charge into the 
subscriber's records in the subscriber database 138. 

20 

In embodiments of the invention using multiple transmission channels, so as to 
be able to handle larger numbers of subscriber requests, special scheduling 
messages are transmitted via the satellite indicating the approximate time that 
the requested information will be transmitted and the transmission channel that 
25 will be used. Subscriber stations waiting for requested information use this 
information to change the transmission channel being monitored so as to receive 
the requested information. 

The actual data transmission is handled by a data formatting and transmission 
30 subsystem 140. This subsystem includes memory for buffering and formatting 
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the data scheduled for transmission, as well as circuitry for encrypting, 
packetizing, and then serializing or streaming the data to a transmitter 1 42 which 
transmits the data stream to the subscriber stations 110 via satellite 104. 

In the preferred embodiment, some of the transmitted data is encoded using 
conventional public key encryption methodology. Generally, decoding keys for 
decoding the encrypted data are given only to subscribers who pay for access 
to that data. For instance, whenever a subscriber requests information not 
included in the basic subscriber service, and for which there is an associated 
delivery charge, the program supplier station 102 may select a new encryption 
key for each request in a pseudo-random fashion, passing the corresponding 
decoding key to the requestor via the same telephone connection on which the 
request is received. Encryption and decoding keys are used in the preferred 
embodiment solely in conjunction with the distribution of materials published on 
a fee-per-purchaser basis. When multiple subscribers request the same set of 
materials prior to their transmission, the same encryption and decoding key can 
be used for all of those subscribers, thereby eliminating the need to retransmit 
the requested materials for each requesting subscriber. 

20 Another type of "encryption" used in the preferred embodiment to defeat access 
by unauthorized subscribers to "protected data", without actually encrypting the 
protected data is as follows. The packet IDs associated with the protected data 
need not be sequential, rather they should be assigned on a pseudo-random 
basis. Furthermore, the packets for the requested data should be scheduled 

25 for transmission in a pseudo-random sequence, interleaved with other broadcast 
data packets. This method of "mixing up" the requested data, combined with 
the fact that at least a hundred 10KB packets are transmitted per second, will 
make unauthorized access to protected data extremely difficult even when the 
protected data is not encrypted prior to transmission. 

30 
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The utility of the present invention can be further enhanced by using data 
compression techniques so as to increase the perceived bandwidth of the 
information server, and by downloading software, data compression codebooks 
and the like along with the transmitted information. Data compression is 
5 discussed more below. 

Subscriber Receiving Stations 

Figure 2 shows an individual subscriber station 110-1 having its own satellite 
dish 150, low noise block receiver and signal converter (LNB) 152, receiver 153. 

10 and data filter subsystem 120. Typically, the satellite dish receives a signal 
having a carrier frequency on the order of 12 GHz. the LNB 152 brings this down 
to about 1 GHz with a signal bandwidth of about 500 MHz, and the receiver 1 53 
then translates the received signal into a digital baseband signal. In this 
embodiment, the subscriber station 110-1 is a desktop computer (such as a 

15 Macintosh computer made by Apple Computer, or an IBM PC compatible 
computer made by any one of a large number of manufacturers) to which the 
satellite receiver 150-152-153 and data conversion subsystem 120 have been 
added. Thus, the subscriber station 1 10 has a CPU 154, monitor 156, keyboard 
158, telephone modem 159 and other conventional computer components. 

20 

As noted above, the data filter subsystem 120 receives the incoming data stream 
and selects those data packets that meet selection criteria defined by the 
receiving station 110. The information in the selected data packets is downloaded 
into the memory 1 22-1 24 of the subscriber's workstation or other computer using 
25 subscriber software 160 provided by the information network supplier for that 
purpose. The data filter subsystem 120 is discussed in more detail below with 
reference to Figure 7. 

The present invention also includes the ability to automatically update the 
30 software used by subscribers' computers by transmitting the software updates 

A-54591/GSW 



|5 



- 15 - 

along with other data transmitted by the information server, with different versions 
of the updated subscriber software being transmitted for each of the different 
platforms (i.e., types of CPUs) being used in subscribers' computers. To 
implement this, the subscriber stations 1 1 0 must be set up to automatically accept 
5 and download software updates of the subscriber software 160. 

Referring to Figure 3, the subscriber's data storage 124 in the preferred 
embodiment includes a data area 161 reserved for downloading information from 
the data filter subsystem 120. Depending on the memory capacity of the 
10 subscriber's data storage devices 124, this data area 161 might be as small as, 
perhaps, 2MB, or as large as perhaps 1 00MB, with a reserved data area of 1 0MB 
being more typical for an individual subscriber. 

One feature of the present invention is that the subscriber software 160 has a 
1 5 set of subscriber request handling routines 1 62 which interact with the data filter 
subsystem 120 to arrange for downloading of the information requested by a 
subscriber. The request handling routines include a special "smart caching" 
routine 163 for anticipating future information requests by the user. Data 
corresponding to these anticipated information requests is downloaded into a 
20 portion 164 of the reserved data area 161 called the smart cache. Since a 
virtually unlimited amount of disk space could potentially be occupied by these 
anticipated requests, only some of which will actually be requested by the 
subscriber, the smart cache 164 is used as a temporary buffer, the oldest 
contents of which will be overwritten as information from new anticipated requests 
25 is downloaded. 

The subscriber software 160 also includes a data decompression routine 165 
for decoding information transmitted in compressed form, a cache management 
routine 166 for controlling the downloading of information into the data area 161 
30 and determining which information in the data area 161 to delete when the area 



A-54591/GSW 



• # 



- 16- 

overflows, and a user profile setup program 167 which compiles a "user profile" 
data structure 300 that defines a default set of information to be retrieved from 
the transmitted data and stored in the data area 161. As noted previously, the 
data decompression routine used can change depending on the nature or type 
5 of data. 

Figures 4 and 5 show two alternate configurations of subscriber stations, both 
of which reduce the cost per subscriber for the required receiver hardware. In 
Figure 4, the received satellite signal is translated by a low noise block converter 

1 0 (LNB) 1 52 to an intermediate carrier frequency of perhaps 1 GHz, which is then 
translated down to baseband by a receiver 153. The receiver 153 outputs a logic 
level bit stream that is received by a cable television (CATV) distribution plant's 
head end 170 and then retransmitted via cable 172 to all the subscribers of the 
cable television system. Subscribers to the information network connect their 

15 computers to the cable television cable 172 using a conventional tuner 174 to 
bring the satellite signal down to baseband. The remaining portion of each 
subscriber's station is the same as shown in Figure 2. Two advantages of this 
system configuration are (1) that individual subscribers save the cost of having 
a satellite receiver 150-152, since one such receiver is sufficient to service a 

20 large number of subscribers, and (2) better reception of the satellite signal, with 
lower error rates, is possible since most CATV head ends have larger receiving 
dishes than can be afforded by individual subscribers. 

An important feature of this configuration is data switch 176, which enables local 
25 programming to be added to the data stream received from the satellite. It is 
anticipated that a certain amount of the satellite's bandwidth will either be left 
unused, for instance by leaving room for 1000 locally generated data packets 
each time that a particular special packet is transmitted, or that a portion of the 
data stream will be designated as suitable for being preempted by local 
30 programming. The local program supplier station 178 is essentially a small 
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version of central program supplier station 102 in that it must perform similar 
tasks - collecting information from various (presumably local) programming 
material providers, combining, serializing and transmitting the information during 
the available time slots set aside for local programming. The local programming 
5 data will also be hierarchically arranged with embedded timestamped indices, 
and will generally be set up as a subtree appended to a branch of the data 
hierarchy transmitted by the central program supplier station. 

The ability to have local programming is commercially important, both because 
10 it provides the ability for local CATV stations to collect advertising revenues, which 

enables the data transmission service to be provided free to subscribers, and 

also to make the content of the transmitted data better tailored to its audience. 

Vastly greater information content is another result of having multiple local source 

transmitters. By way of example, if the local program supplier for each local 
15 CATV system provides one percent of the total traffic, and if there are one 

thousand such local program suppliers, the overall information content of the 

information distribution system is increased ten fold. 

In Figure 5, a single satellite receiver 150-152 is connected to a local area 
20 network 1 80 of subscribers. All requests for transmitted information are handled 
by a single data filter subsystem 1 20 and the network server 1 82. In other words, 
requests for information are routed through the network server 182 so that one 
data filter subsystem 120 can be used to select all the information needed by 
all the subscriber work stations 184 on the local area network 180. Similarly, 
25 a single modem 1 86 or a small number of such modems can be used by a large 
number of work stations 184 to submit requests to the program supplier station. 
A significant cost saving associated with this configuration is saved disk space 
. because all of the users can share access to information stored on the server's 
hard disk. Speed of access to data from the information service may also be 
30 improved, compared to an individual subscriber configuration, because more 
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disk space can be dedicated to automatically caching a larger portion of the 
transmitted information, since the cost of the dedicated disk space is being shared 
by a group of users. 

5 This system configuration also spreads the cost of the satellite receiver 1 50-1 52 
and the cost of the data filter subsystem 120 over the same group of users. 
Since it is anticipated that the data filter subsystem 120 might sell for at least 
$500 (in 1991 U.S. dollars) apiece to manufacture, the local area network 
configuration of Figure 5 can result in significant savings per user. 

10 

The system configuration of Figure 5 is likely to be suitable not only for business 
subscribers, but also libraries, schools, kiosks at shopping malls, and other types 
of institutional subscribers. 

15 Data Filter Subsystem 

As shown in Figure 6, each received data packet 200 begins with a packet 
identifier 202, followed by a function code 204 and a set of data 206, followed 
by error detection codes 208. Furthermore, as will be discussed below, the data 
206 includes not only the information associated with the packet identifier 202, 

20 but also timestamped indices 21 0 for the next N (e.g., three) levels of information 
in the information database hierarchy. 

Packet identifiers 202 are used to determine which packets require further 
processing, with all unselected packets simply being discarded by the subsystem 
25 1 20. The function code 204 is used to identify and categorize special messages 
transmitted by the program supplier. The packet ID 202 and the function 204 
are not encrypted, thereby allowing packets to be selected or discarded prior 
to processing the data 206. In the preferred embodiment, the data 206 portion 
of most packets is not encrypted, except for the use of data compression. A 
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standard, commercially available error correction methodology may be used to 
detect and correct data errors caused by noise in the transmission process. 

In some applications, data packets transmitted in response to subscriber requests 
will be encrypted. However, given the huge amount of data being transmitted 
It is impractical for a "pirate" to scan all the transmitted data for useful information. 
Without knowing the packet IDs for a particular set of information, and the proper 
order of those packet IDs for reconstructing a particular file or program, it is 
virtually impossible to extract useful information from the data stream, thereby 
making data encryption unnecessary in most cases. 

Most or all of the data transmitted by the system will be transmitted in 
compressed form. In order to make the data compression both efficient and 
flexible, a general set of data decompression software is included in the 
15 subscriber software 160 (see Figure 2) given to each subscriber, along with a 
default "code book" or coding algorithm to be used for the decompression 
process. Other code books or coding algorithms, associated with various types 
of transmitted data will be transmitted along with the compressed data. A typical 
code book, for example suitable for encoding newspaper articles, will occupy 
20 less than 50KB. When used in conjunction with megabytes of data, and taking 
into consideration that the compression will typically allow two-to-one or better 
data compression, the transmission of code-books along with the related data 
involves an inconsequential amount of overhead. The data decompression is 
performed on the subscriber's own computer, where computation time is 
25 presumably virtually free of cost. Also, data decompression is not time consuming 
computationally using currently available desktop computers, and will not 
noticeably impact the performance of the system from the viewpoint of 
subscribers. 
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Referring to Figure 7 the data filter subsystem 120 used in subscriber stations 
to receive broadcast data works as follows. The satellite transmitted data stream, 
which contains a sequence of data packets, is first processed by a conventional 
satellite signal recovery circuit 220 that stores the received data packets in a 
5 ring buffer 222. In the preferred embodiment, the ring buffer 222 has 4 
megabytes of storage, enabling about four hundred 10KB data packets or about 
2.5 seconds of data at the system transmission rate to be stored. The required 
size of the ring buffer 222 is governed by the maximum possible delay before 
a selected packet is downloaded onto the subscriber's host computer. 

10 

Each time that a complete data packet is transferred by the data recovery circuit 
220 to ring buffer 222, the data recovery circuit 220 sends a "packet received" 
message to the subsystem's controller 224. The controller 224 responds by 
comparing the packet's ID 202 with a list of packet ID values and/or ranges of 
1 5 packet ID values stored in filter list 226. Packet's which match any of the packet 
ID values in the filter list 226 are "selected" and downloaded to the subscriber's 
computer via a conventional bus interface circuit 234, such as a SCSI (Standard 
Computer System Interface) interface. 

20 To stretch the time available for downloading, a modified ring buffer technique 
can be used in which the buffer 222 is divided into a predefined number of slots, 
each of which can hold one packet. Slots holding packets selected for 
downloading are "marked" (e.g., using a set of bit flags) and thereby prevented 
from being overwritten by incoming data until the packet in the slot has been 

25 downloaded and the slot unmarked. 

If a substantially longer period of time is required to ensure successful 
downloading of the received data to the subscriber's computer, either additional 
random access memory can be added to the ring buffer 222, or a fast disk 
30 storage device can be added to the data filter subsystem 120 to enable larger 
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amounts of data to be buffered by the subsystem 120. The use of disk storage 
device for buffering may be justified for local area networks of subscribers, 
primarily to avoid loss of data when the network server cannot service the 
subsystem 120 within a few seconds. 

5 

In embodiments of the invention which include the transmission of video or audio 
programming material, the video programming material is transmitted digitally, 
in a compressed form. Many video data compression systems are commercially 
available. The video program material stored in the information database 112 

10 is stored in compressed digital form by the program supplier station 102. As 
a result, each data conversion subsystem could optionally include a video 
decompression and modulation circuit 240 for decompressing the compressed 
digitized video data, and then converting it back into an analog signal suitable 
for either playback or recording using a conventional VHS video recorder. In 

15 alternate embodiments, the received video data will be downloaded in 
compressed form, for example for storage on a digital tape recorder. In this 
alternate embodiment, the subscriber's computer will have to include a video 
decompression circuit. This embodiment has the advantage are reducing the 
storage requirement associated with digital video signal storage. 

20 

When multiple transmission channels are used by the program supplier station, 
the controller 224 sends control signals to the satellite receiver as to which 
channel should be selected. Typically, the satellite receiver will normally be tuned 
to a "main" or primary channel. Other channels are only selected during the 
25 expected transmission times associated with packet identifiers stored in the filter 
list. 

Referring to Figure 8, the "requested data" in the filter list 226 is stored as an 
ordinary B-tree data structure, similar to those found in almost any DBMS 
30 (database management system). At the top of the tree is a root node 260. At 
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the bottom of the tree are leaf nodes 266, 268. In between the root 260 and 
the leaf nodes are Intermediate nodes 270. 272, 280. B-trees are well known 
data structures used to quickly identify whether or not a specified key value is 
present in a database. In this case, a three level tree, with perhaps up to twenty 
5 entries per node, will facilitate high speed determination as to whether or not 
a particular packet ID is present in the filter list. 

Each leaf node in the filter list contains five fields: (1) a packet ID, (2) a flag 
indicating whether the packet is provided as part of the basic subscriber service 

1 0 or whether it is a packet that has been specifically requested to be transmitted, 
(3) the identify of the channel on which the packet is to be transmitted, (4) a 
repetition rate value, which is the rate at which transmission of the packet is 
repeated (e.g., onetime per hour, or zero for information specifically requested), 
and (5) a time skew value indicating the approximate time that the packet is 

15 scheduled to be transmitted within the packet's repetition tier. 

To determine the approximate next time that a packet is will be transmitted, the 
repetition rate and time skew fields are used: 

20 Next_Time = Rep_Period * INT{Current_Time/Rep_Period} 

+ Time_Skew 

where INT is the integer function. For instance, if the Current_Time is 09:35, 
the Rep_Period is one hour, and the Time_Skew is 5 minutes, then the next time 
25 that the packet will be received is about 10:05. When a data packet ID in the 
filter list specified a channel other than the main data transmission channel, the 
Next_Time value is used to determine when the subscriber's system should be 
tuned to the other specified transmission channel. 
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Attemately, the packet IDs for the information requested by the subscriber could 
be stored in a Content Address Memory (CAM), thereby providing a 
match/no-match determination for each packet ID In a single clock cycle. 
However, the CAM can store only a limited number of selected packet IDs, while 
5 the number of packet IDs that can be stored in the B-tree structure of Figure 
8 is limited only by the amount of low-cost random access memory provided to 
store the filter list. 

Information Data Hierarchy and Indexing 

10 Referring to Figures 1 and 9, all of the information in the program database 112 
is hierarchically organized using a set of assigned indices to reference each 
distinct portion thereof. For the purposes of transmission, the database is broken 
into data packets each holding up to about 10,000 bytes of data. In general, 
each data packet has a unique packet ID, except that a contiguous and 

15 associated set of packets might be assigned a single packet ID. Sequence 
numbers inside the packets sharing a common packet ID can be used to ensure 
that the data in these packets is properly ordered after downloading into the 
subscriber's computer. The indices associated with reference data in the 
database are included in the root information, and also may be embedded in 

20 various portions of the transmitted data for the purposes of cross-referencing 
related information. 

Referring to the hierarchy or tree data structure 290 in Figure 9, a set of Root 
Information 292 is considered to be at the "top" of the hierarchy. For the 
25 purposes of this discussion, the directions "up" and "down" in the hierarchy are 
as shown in Figure 9. As shown, local programming provided by a local program 
supplier station is typically defined as a subtree of the hierarchy. Alternately, 
the locally provided programming could be appended to several portions of the 
hierarchy 290. 

30 
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Depending on the amount of disk storage space available on the subscriber's 
computer (or network server in the case of networks of subscribers), a certain 
amount of the top portion of the information hierarchy 290 will be automatically 
stored and updated in the subscriber's computer so ihat this information is always 
5 available. Depending on how the subscriber configures his computer system, 
this may vary anywhere from 0.25 megabytes of data to perhaps 10 megabytes 
of data or more. 

As shown in Figure 9, the information database is structured so that all data down 
10 to a certain depth in the hierarchy is provided as part of the basic subscriber 
service, while items further down the hierarchy are available only by special 
request. As shown in Figure 9, the depth of information provided by the basic 
subscriber service may vary in different portions of the hierarchy. 

15 The information database is also hierarchically structured in a second manner. 
In particular, the information included in the basic subscriber service is divided 
into "root information" plus several tiers of information which are transmitted at 
decreasing frequency. Table 1 shows an example of how the 1 .5 megabyte per 
second bandwidth associated with a single channel transmission system may 

20 be subdivided into tiers. The information database's root information, comprising 
about 0.25 megabytes that provides a large part of the indexing and top level 
menus needed to access the information database, is retransmitted ten times 
per hour. The first tier of information, transmitted four times per hour, comprises 
150 megabytes of information most frequently needed by subscribers. Each 

25 successively lower tier of information contains a larger amount of information 
than the next higher tier, prioritized in accordance with actual or expected 
subscriber usage, and is transmitted less often. In this example, twenty-five 
percent of the available bandwidth is reserved for responding to subscriber 
requests for additional data. It is noted that despite the high repetition rate of 
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some of the basic subscriber service data, only 35. 1 % of the available bandwidth 
is occupied by repeated information. 



10 



15 



20 



25 



TABLE 1 

Example of Data Channel Usage 



Information Type 
Root Information 
let Priority Info 
2nd Tier Info 
3rd Tier Info 
4th Tier Info 
5th Tier Info 



Repeat Rate 
10 times/hr 
4 times/hr 
1 time/hr 
0.25 times/hr 
1 time/12 hrs 
1 time/24 hrs 



Subtotals - unique info 

- total bytes Tx 

Data Tx by Re<iuest N/A 

Total 



- 1.5MB/Sec Channel 



Megabytes 
Per Transmsn 

0.250 

150 

600 

2,400 

7,200 

39,540 

49,890.25 
97,200.00 

32,400 

129,600.00 



% of Ofctal 
BarAridtfa 

00.0463 

11.1 

11.1 

11.1 

11.1 

30.5537 

75.00 

25.00 
100.00 



30 It should be emphasized that the repetition rates and other data values provided 
in Table 1 are only one example of how the bandwidth of a channel could be 
utilized. The particular repetition rates associated with each tier of data and the 
amount of data allocated to each tier are selectable parameters that will need 
to be carefully considered in order to maximize utility of the system for most 

35 subscribers. 



40 



Each publisher providing materials to be published by the system will structure 
that information by indicating what portions are to be placed in Tier 1, what 
portions are to be placed in Tier 2, and so on. Different publication costs will 
be associated with each tier. Material published on fee-per-copy basis, wherein 
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a royalty must be paid to the copyright owner on a per copy basis, is transmitted 
only in response to specific subscriber requests. 

5 An alternate tiering scheme to the one shown in Table 1 would be to offer 
publishers a semi-continuous range of repetition rates in the form of a set of 
perhaps 1 00 different repetition rates, each having an associated publication 
cost, with much smaller increments between tiers than the example given in Table 
1 . In addition, publishers would be able to specify time frames during which each 

10 subset Of published information should be broadcast, thereby giving publishers 
a large degree of control over when their programming materials are broadcast. 
Of course, regardless of the repetition rate tiering scheme used, the publisher's 
ability to select any particular repetition rate and distribution time frame would 
be subject to the availability of the required transmission time. Presumably, 

15 supply and demand for time slots in the information server's transmitted data 
stream will be balanced by appropriate pricing policies, much in the way that 
advertising time is sold by radio and television broadcasters. 

User Profiles For Automatic Data Retrieval 

20 Referring to Figure 10, in the preferred embodiment, each subscriber's computer 
compiles a "user profile" data structure 300 which defines a default filter set 
defining data to be stored and updated in a subscriber's computer. In particular, 
the user profile 300 stores a value 302 indicating the amount of disk storage 
space that has been reserved for storing both root information and th6 frequently 

25 requested information specified elsewhere in the user profile. This value 302 
will typically vary from 0.25 to 10 megabytes. A depth value 304 indicates the 
amount of root information to be downloaded into the subscriber's computer, 
and items 306-308 specify portions of the information hierarchy that the subscriber 
wants to have stored and updated on an ongoing basis. For instance, the 

30 subscriber might specify that all information regarding certain sporting events 
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(such as professional football games) be automatically retrieved and downloaded 
into his computer. Software known as "computer agents" might be employed 
by subscribers for the purpose of automatically searching for specified data, 
organizing the results, and so on. 

5 

The data in the user profile 300 is translated into specific packet ID information 
that is then stored in the filter list 226 (see Figure 7) of the subscriber's data filter 
subsystem 120. As a result, transmitted information corresponding to the topics 
specified in the subscriber's user profile 300, is automatically downloaded into 
1 0 the subscriber's computer and is also automatically updated as the transmitted 
information corresponding to those topics is update. By using the "user profile" 
to pre-load the packet ID filter list, the subscriber's wait time for the information 
most frequently requested by that subscriber is reduced to zero. 

1 5 Timestamped Indices and 

"Smart Caching" to Avoid Data Delivery Delays 

Referring to Figure 1 1 , timestamped indices 322 are embedded in the information 
320 transmitted by the program supplier. These indices are embedded not only 
in menus, such as the top level menu shown in Table 2, but are also embedded 

20 inside the text and/or figures of the received information, thereby referencing 
and cross-referencing other data that is available in the database. The structure 
of the indices 322 is the same as shown for indices 266, 268 stored in the filter 
list. It important to note that each index item is timestamped (unless it references 
data that is transmitted only upon request) by the values in the Rep_Rate and 

25 Time_Skew fields of the index. Using the "Next_Time" computation formula 
shown above, the timestamp data in each index allows the user to be notified 
as to the amount of time it will take before a requested item will be received. 
Items In the received information which have an associated timestamped index 
are highlighted or otherwise visually distinguished. When the subscriber selects 

30 the item, the associated timestamp information is displayed, thereby informing 
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the subscriber whether or not this is a special request item obtainable only at 
extra cost, and also the associated delivery wait time if the item is included in 
the basic subscriber service. 



TABLE 2 
Example of Main Menu 

10 PLEASE SELECT TOPIC /CATEGORY: 
TODAY 

, Newspapers & Magazines 

T^Of Financial 
15 Catalogs 

Government 

Telephone Directories 
Travel 

Reference Books and Information 
20 Other Books 

Computer Software & Games 
Audio Programming 
Video Programming 

25 Subscriber Menus 



In accordance with the present invention, there will often be no delivery wait time 
30 for items requested by a subscriber. In particular, whenever the subscriber 
requests an item, the subscriber software 160 in the subscriber's system will 
also enter into the filter list "look ahead" requests for associated information in 
the information hierarchy - typically for the items in the next level or two down 
the hierarchy from the requested item. The extent of the look ahead requests 
35 is limited primarily by the amount of disk storage space available in the 
subscriber's computer. Since it will often take the subscriber a couple of minutes 
to examine the information obtained from a request, the data from the look ahead 
requests will often be received before the subscriber actually requests them. 
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The data corresponding to the 'look ahead" requests entered in the subscriber's 
filter list are stored, when received, on the subscriber's computer's hard disk (or 
other comparable mass storage device). In the preferred embodiment, a portion 
of the subscriber's hard disk is set aside as a "smart cache" (see Figure 3) 
5 specifically for "caching" of these look ahead requests. When and if the 
subscriber requests any of the data in the smart cache, it is available for 
immediate access by the subscriber. If desired, the selected data in the smart 
cache can be moved to other areas of the user's hard disk. Unused data in the 
smart cache is overwritten with new data as required by other received data 
10 corresponding to look ahead requests. 

Video Programming 

The transmission of video programming is a classic problem in that it tends to 
occupy large amounts of bandwidth. The typical byte count for an hour of video 
1 5 programming, using available data compression techniques that do not noticeably 
degrade the quality of the program, is about 0.9 gigabytes, which is equivalent 
to about 15MB per minute. 

Due to the large amounts of data associated with video programming, in the 
20 preferred embodiment most video programming is transmitted on a separate 
channel. Using a preferred data transmission rate of 1 .5MB per second, which 
is equivalent to 90MB per minute, it is possible to use time multiplexing so as 
to transmit six video programs (each having an associated data rate of 15MB 
per minute) simultaneously. In addition to using time multiplexing, a data tiering 
25 arrangement similar to the one shown in Table 1 can be used, whereby certain 
types of video programming are repeated more frequently than others. 

Alternately, since the data rate of the channel is higher than the data rate required 
for any one video program, it would be possible to transmit a video program in 
30 a fraction of the playback time associated with the program. However, it is 
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anticipated that many subscriber's data receiver systems will not be able to 
receive and store downloaded data at a sustained rate of 1 .5MB per second (i.e., 
for a sustained period of time in excess of, say, two or three seconds). If the 
received video programming data is being recorded in a typical subscriber station 
5 on a tape recorder, then the multiplexing rate used for transmission would be 
selected so as to match the data recording rates of such tape recorders. 

It is anticipated that at least some subscriber's will be interested solely in 
receiving video programming from the data transmission system. For such 

10 subscribers, the subscriber's "computer" will actually be a data storage box which 
operates in much the same manner as a conventional video cassette recorder. 
For these systems, the data filter system would be preprogrammed to store all 
hierarchy information related to video programs. On screen menus would be 
displayed to allow the subscriber to select programs. The timestamp information 

15 embedded in the hierarchy information would be used to inform the subscriber 
as to when the selected programs would be received. Packet ID values 
corresponding to the selected programs would be stored in the filter list and the 
selected programs would be stored, generally on tape, for later viewing by the 
user. 

20 

Potential Publishers 

The data transmission system of the present invention is suitable as a publication 
medium for a wide range of potential publishers, including both publishers who 
wish to disseminate information for free and for publishers who wish to charge 
25 for each copy disseminated. The potential publishers include industrial suppliers 
(product data, catalogs, product information), software publishers (computer 
software and games, video and audio programming), catalog sales companies 
(text and pictures), advertisers, real estate sellers, travel packages, publishers 
of reference materials or subscription based materials, and many others whose 
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information changes relatively often or who wish to have low cost distribution 
to large numbers of customers. 

Alternate Embodiments 

Many aspects of the above described preferred embodiment might be modified 
to accommodate various commercial and technical requirements. For instance, 
dedicated optical fiber channels might used to transmit data from a national, or 
one or more regional, program supplier stations, to cable television transmission 
stations distributed over a wide geographic area. Certainly, the data transmission 
rates used by the system are totally dependent on the particular transmission 
medium used and the maximum acceptable cost for each subscriber's data filter 
subsystem. 

Numerous minor technical choices in the design of the system can be changed 
without affecting the overall merit of the data transmission system. For instance, 
in place of a standard, commercial error correction methodology, a proprietary 
error correction methodology could be used if there were a reason to do so. 

While the present invention has been described with reference to a few specific 
embodiments, the description is illustrative of the invention and is not to be 
construed as limiting the invention. Various modifications may occur to those 
skilled in the art without departing from the true spirit and scope of the invention 
as defined by the appended claims. 
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